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Abstract 

We perform a detailed hadron-level study on the sensitivity of Higgs boson pair production via 
the WW*WW* channel with the final state 3£2j + $ at the LHC with the collision energy \fS = 14 
TeV and a future 100 TeV collider. To avoid the huge background from pp —> ZW +jets processes, 
we confine to consider the four lepton patterns: e^e^pZ and /A/AeU We propose a partial 
reconstruction method to determine the most reliable combination. After that, we examine a few 
crucial observables which can discriminate efficiently signal and background events, especially we 
notice that the observable mT 2 is very efficient. For the LHC 14 TeV collisions, with an accumulated 
3000 ftv 1 dataset, we find that the sensitivity of this mode can reach up to 1.5 a for the Standard 
Model and the triple coupling of Higgs boson A 3 in the simplest effective theory can be constrained 
into the range [-1, 8 ] at 95% confidence level; at a 100 TeV collider with the integrated luminosity 
3000 fb _1 , the sensitivity can reach up to 13 a for the Standard Model and we find that all values 
of A 3 in the effective theory can be covered up to 3a even without optimising signals. To precisely 
measure the triple coupling of Higgs boson A 3 = 1 of the Standard Model at a 100 TeV collider, by 
using the invariant mass of three leptons which is robust to against the contamination of underlying 
events and pileup effects and by performing a y 2 analysis, we find that it can be determined into 
a range [0.8, 1.5] at 95% confidence level. 
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I. INTRODUCTION 


The last building block of the Standard Model (SM), Higgs boson, has been discovered 
by ATLAS and CMS Collaborations [HE], The interaction of Higgs boson with the fermions 
of the SM and its self couplings are new types of interactions which are different from those 
described by the gauge symmetries in the SM. To ascertain the nature of Higgs boson, it is 
important to precisely measure the Yukawa type interactions which can be determined by 
measuring the Higgs decay into fermion pairs from single Higgs production at future LHC 
runs and Higgs factories [21 Ej- While the analysis on the Higgs self couplings via Higgs pair 
and multi-Higgs boson production is achievable at high luminosity LHC runs and future pp 
collider, say a 100 TeV collider |5j. 

The determination of the Higgs potential is of important significance, since the potential is 
directly related to the structure of vacuum, the electroweak phase transition and electroweak 
baryongensis, and the fate of our universe as well. It is useful to address the issue whether 
the Higgs boson is elementary or composite. It is also crucial to probe new physics, which is 
believed to exist somewhere and somehow since there are fundamental issues which cannot 
be solved by the SM itself, e.g. the matter-antimatter asymmetry in our universe, the 
quadratic divergence of the Higgs mass term, and the mystery of dark matter, etc. 

The SM predicts trilinear and quartic self couplings in the Higgs potential at tree level. 
Both trilinear and quartic Higgs self couplings are related to the Higgs boson mass by 
m 2 H = ^Asm^ 2 , where trilinear and quartic couplings are proportional to Asm, which is the 
dimensionless coupling of Higgs potential before electroweak symmetry breaking. In the 
language of an effective field theory, the trilinear coupling term can be simply expressed as 



( 1 ) 


where A 3 = 1 corresponds to the SM case and there is relation Ag r M 


3/2Asm in this 


parametrisation. It is well known that to determine the quartic coupling of the SM might 
be challenging at the LHC due to the small production rate of three Higgs boson final state, 
but to detect the trilinear coupling via Higgs pair production is expected to be within the 


reach of the LHC. The measure of the trilinear coupling up to the precision 10% at a future 


100 TeV colliders is feasible [ 6 ], which can further pinpoint and discover new physics. 

The importance of Higgs pair production has attracted attentions long time ago. The¬ 
oretical investigations on the Higgs pair production in the SM began with the pioneering 
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works M, where the gluon-gluon fusion [7] and the vector boson fusion M processes 
had been considered. It has been found that at hadron colliders the gluon-gluon fusion 
production is almost one order of magnitude larger than the weak boson fusion process. 
There are lots of effort to improve theoretical prediction on the Higgs pair production. For 
example, the NLO and NNLO QCD corrections to gluon-gluon fusion had been considered 
in [TUI nj and recently in D2J by using the large top mass approximation and normalizing 
the partonic cross section using the exact LO result. The finite top quark mass effects have 
been analysed at NLO in [T3] via expansion by top quark mass. Recently, NNLO QCD 
corrections to the VBF Higgs pair production has been done by the USTC group [13]. 

Besides the detecting of the Higgs self-couplings of the SM, multi-Higgs production at 
various colliders are of great importance to probe new physics, as explored in reference [15j . 
At hadron colliders, Higgs pairs can be enhanced by other heavier scalar resonances [TUIEU] . 
By measuring the signal of Higgs pair production, we can extract the triple Higgs coupling 
and then depict the shape of Higgs potential so as to distinguish various electroweak symme¬ 
try breaking models. For example, the composite models predict a vanishing or small triple 
couplings |2T] and a model with effective potential V = A (H + H) 2 + (H + H) 3 / A 2 predicts 
a triple Higgs coupling 7/3 times that of the SM. The measurement of the cross section 
of Higgs pair production is also important to distinguish models where Higgs is assumed 
to be elementary, like in the supersymmetric model where superparticles can enhance the 
production rate [22l 124] and like in the two-Higgs doublet model the extra scalars can en¬ 
hance the production rate |25j . While the Higgs-Gravity model mum predicts a coupling 
dependent of external momenta. These specific models can be more generally formulated 
and conveniently explored in the framework of the effective Lagrangian up to 0(p 6 )[28j; as 
demonstrated in a recent study in Reference [2Dj. 

A comprehensive study on various productions at the generator level has been recently 
investigated in [30] by using the automatic matrix clement generator Madgraph5. According 
to the study of [30], in the SM the leading contribution to Higgs pair production at the 
LHC and a future 100 TeV collider is via gluon-gluon fusion. The subleading production 
mechanism is via weak vector boson fusion processes PUGS!- The tt associated production 
can become comparable with the weak vector boson fusion production when the collision 
energy is around 100 TeV [30]. The effects of top quark mass in double and triple Higgs 
production at hadron colliders have been studied in [32]. The kinematics of the di-Higgs 
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bosons decay to 6677 have been analyzed in [33] and their effects to the measurement of non¬ 
standard values of A 3 have been explored. Interested readers can refer [30[ 03] for theoretical 
progress in the fixed order QCD calculation for single Higgs and Higgs pair productions, and 
top quark pair production as well. 

Except the theoretical efforts on the Higgs pair production, there are lots of efforts to 
improve theoretical predictions of the Higgs boson decay. For a Higgs boson with mass 
around 125-126 GeV, its main decay final state is bb, and the state-of-art research on its 
partial width is up to O(ag) in [36]. Higgs decaying into other fermion pair have also been 
investigated up to two loop level. The H —>■ gg decay channel is up to N 3 LO QCD in 
in the large top quark mass limit, and the top quark mass effects are analyzed in [38] . 
The partial width of H —> 77 channel is known up to NLO EW and NNLO QCD[39J 00]. 
The decay channel H -7- Z-, is known up to NLO QCD in [4Tj . For the decay channel 
H —>■ WW*, ZZ* —> 4/, 0(as ) and 0(a) corrections have been studied in [02HB]. Interested 
readers can refer [H5H43] for more information on the current status of our understanding to 
Higgs boson. 

Recently the signals of Higgs boson pair production at the LHC have been further studied 
via a few decay channels. A recent theoretical review can be found in mu Em. For example, 
the study of HH —s- 6677 channel can be found in Ref. [51] with a significance of about 
1.5 a for a integrated luminosity 600 fb -1 at LHC with 14 TeV collision energy is assumed. 
A recent search by the CMS collaborations can be found in Ref. m The authors of the 
Ref. [31] updated this study and provided a significance of about 6.46 a for the integrated 
luminosity 3000 fb _1 at the 14 TeV. The study for the HH —)■ bbr + T~ channel can be 
found in Ref. [3T, 53||52|> where the authors of the Ref.[3T]J provided a significance of about 
9.36 cr for 3000fb -1 LHC. The channel HH — > bbW + W~ -7- bbivejj has been studied in 
Ref. [55], where a significance of 3.1 cr for 600 fb -1 LHC have been obtained. The mode 
HH —> bbW + W~ —$■ bbdi'ii'i'ei has been studied in Ref. ra and a significance of about 1.53 
a for 3000 fb _1 the LHC has been achieved. A recent updated study on 4 b jet final state 
can be found in Ref. [56] and a search for new physics by the CMS collaborations can be 
found in Ref. The probe of the VBF Higgs pair production can be found in Ref. [55] - 

The third largest decay fraction channel for Higgs pair is WW*WW* channel. The 
subsequent decay mode 8 j and I6j + $ will be too hard to be found due to huge QCD multi¬ 
jets and W+multi-jets background. The decay mode Z ± Z =F 4 j + $ will be also too hard to 
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be found due to the huge Z( 7 *)+multi-jets and W + W _ +multi-jets background. The decay 
mode 4 1+ missing energy will has a tiny production rate. So only two subsequent decay 
channels are reachable: two same-signed lepton mode / ± / ± 4 j + $ and three leptons mode 
M2 j + $. 

These channels had been taken into account in Ref. [59] [60] with the assumption that the 
Higgs mass is in the range 140 GeV < rrijj < 200 GeV and the WW is the main decay channel 
for Higgs and both W bosons are on shell, where, at parton level, important acceptance cuts 
and some simple kinematic variables, especially the invariant mass of all final states which 
was crucial to suppress the background events of tt + jets and multi-top processes, were 
carefully studied. Considering that the measured Higgs boson mass is 125 GeV or so and 
the branching fraction of Higgs boson decay to WW* is considerably smaller than that 
assumption in Ref. ED], the production rate of signal in this final state is almost one 
order of magnitude smaller and the discovery of the signal in this mode is very challenging. 
Furthermore, not all the W bosons from the decay of Higgs boson can be on-shell which 
makes the signal hard to be distinguished. Therefore it is necessary and quite nontrivial to 
revisit and perform a more detailed analysis by taking all these facts into account. In this 
work, we propose a partial reconstruction procedure of Higgs pair in the final states and 
examine more useful kinematic variables especially the m,T 2 variable in our analysis, which 
has been found can suppress most of background efficiently. In order to further improve the 
significance, we also apply two multivariate analysis approaches to optimise the signal and 
background discrimination. 

In this work, we update the study explored in Ref. [60] and consider 3l2j + $ final state 
in more details and will focus on the sensitivity at the LHC and a future 100 TeV collider 
to the triple Higgs coupling. It is confirmed that the background from ZW + jets is huge. 
In order to overcome this type of background, we deliberately consider the four three-lepton 
patterns: e ± e ± / u T and ^i ± /i ± e =F . Since it is essential to reconstruct the crucial information 
of signal events, we propose a partial reconstruction method and an efficient method to fold 
the right combination of Higgs bosons. After the reconstruction, we further construct most 
of kinematical variables, especially the ?7iT2 observable and examine their discrimination 
power to signal and background events. Considering the signal events are few, in order to 
enhance the significance, we apply two multivariate analysis methods to optimise the signal 
and background discrimination. Our results show that this channel can reach to a sensible 
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significance i.e. 1.5 or so, at the 14TeV LHC with 3000fb -1 of integrated luminosity. We 
also extend the study to 100 TeV collisions with a luminosity 3 ab " 1 and find that the mode 
can be used to explore the discovery of all values of A 3 . When the SM will be confirmed, 
this mode can also be used to perform precision measure of A 3 to the range [0.8, 1.5], which 
is comparable with the precision measurement by using the ratio of cross sections as pointed 
out in [ 6 l] . 

This work is organized as follows. We will describe the event generation of signal and 
background in Section II. Due to the existence of three neutrinos which consist of missing 
energy in events and are unable to be fully reconstructed, we will propose a partial recon¬ 
struction method for the visible objects and analyse the key kinematic features of signal 
events in the 3l2j + $ mode in Section II. By using the constructed kinematic observables, 
we will consider the sensitivity of 14 TeV LHC and a 100 TeV collider in Section IV. We 
will end this paper with discussions and conclusions. 


II. EVENT GENERATION AND KINEMATIC FEATURES OF SIGNAL EVENTS 

We have generated the signal events in the following steps. 1) We have used the leading 
order matrix element computed by MadLoop/aMC@NLO|62| and Gosarn[U3]. which have 
taken into account the top-quark mass dependence in the loop evaluations. We have cross¬ 
checked the generated codes with the matrix element obtained by the FormCalc|64j. and 
have found these independent approaches yielding the same results. 2) We perform the 
integration over the whole phase space by using the VBFNLO code [U3HH7] and obtain the 
total cross section. 3) After reaching a stable total cross section to the desired precision, 
we reweight each events in the phase space so as to yield the unweighted events at the 
parton-level. 

For the LO cross sections, we used CTEQ6L1[5SJ PDF sets. We set the cuts in the 
phase space for the final Higgs bosons as \rj(H)\ < 5 and Pt{H ) > 1 GeV. We set the 
renormalisation and factorisation scales as [i r = /x/ = \/§, and have reproduce the LO total 
cross section as 22.8 fb, which agrees with our previous results [69]. 

Using the unweighted events, we use the package DECAY provided in MadGraph5j70| to 
decay Higgs into a pair of W bosons (one is on-shell and the other is off-shell) and further 
to decay W bosons into quarks and leptons. Therefore, all spin correlation information in 
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the final states have been taken into account in the data sample. Before considering lepton 
and jet and missing energy reconstruction at detector level, we use PYTHIA6[7TJ to perform 
parton showering. 

For the background processes, we use Madgraph5[70] to generate events, and also shower 
it with PYTHIAbjTTJ. In order to avoid the double-counting issue of jets originated from ma¬ 
trix element calculation and the parton shower, we apply the MLM-matching implemented 
in Madgraph5[7p]. In practice, for the background events of ttW, we include both processes 
pp —>■ ttW and pp —>• ttW + j to form an inclusive dataset. For the background events of 
WWW, we include processes pp —>■ WWW and pp —> WWW + j and pp —> WWW + jj, 
and similarly for ZW,HW backgrounds. The ZW ± —¥ l + l~W background are generated by 
using exact matrix element which including off-shell Z and 7 effects, and other backgrounds 
are all generated on mass-shell. We ignore ZZ background due to it require one lepton is 
missing and should be much smaller than ZW background. The background ttZ,ZWW is 
also ignored because it is much(20 30) smaller compared to the ttW,WWW background, 
correspondingly. We also ignore tttt due to its tiny cross section and the efficient rejection 
by b-taggings. We would like to mention that in all background event generation that the 
decay correlation for all final states have been correctly accounted for. 

For the analysis at the detector level, we first reconstruct isolated leptons in each event. 
After that we pass all the rest of visible particles to Fast Jet [72] to cluster to into jets. We 
adopt the anti-kt algorithm m with cone parameter R = 0.4. After that, the transverse 
missing energy is reconstructed. In this study, we have neither taken into account the 
magnetic effects for charged tracks nor the energy smearing effects for leptons and jets. 
Therefore, our analysis should be regarded as a hadron level analysis. 

In order to suppress the dominant background and select the most relevant events, we 
introduce all the following pre-selection cuts at event-by-event level: 

• We veto events with isolated and energetic photon(s) with P t ( 7 ) > 10 GeV and 
1*7(7) I <2-5; 

• In order to suppress the large background from ttW and ttH, we veto events with 
tagged B jets. In our simulation, the tagging efficiency of B jets is assumed 60%. 
Therefore, roughly the background from ttW and ttH can be suppressed by a factor 
0.16. 


7 


D/Dp D/Dp D/Dp 



(a) Pt of 1st lepton (b) Pt of 2nd lepton 
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(f) Invariant mass of two leading jets 


FIG. 1. The transverse momentum of leading three leptons and leading two jets and the invariant 
mass of two jets as well are shown at parton level. Both signal and background are normalized 


one. 


















































• The preselection rule for three isolated leptons is found to be crucial. We demand that 
there are exactly three isolated leptons being found, with the requirement that the first 
leading lepton should have a momentum larger than 30 GeV, the next leading lepton 
should larger than 10 GeV, the softest lepton should larger than 10 GeV. Since there 
must be a lepton coming from the on-shell W boson decay, so we require the leading 
lepton should be hard enough. At the meantime, there must be a softer lepton which 
comes from off-shell W bosons decay. In order to increase the acceptance for signal, 
we deliberately lower the momentum of the third lepton. In Figure (jl^-c), we present 
the distributions of 3 leptons. Considering that threshold of lepton reconstruction is 
around 3 GeV and if lepton is larger than 5-8 GeV at the both CMS and ATLAS 
detectors lepton reconstruction efficiency can be 95% [73J 75], to find the soft leptons 
with P t > 10 GeV in the signal event should be plausible. 

• In order to suppress the large background Z/ylF+jj, we only consider the following 
four modes with two leptons of same sign and same flavor plus an extra different 
flavored lepton: e _ e _ /i + , e + e + /z”, /i+zr+e - , and /i _ /r~e + . After this preselection cut, 
we noticed that the background events from the processes Z/yLF+jj can be safely 
neglected. 

• At least two jets in the events are required to be successfully reconstructed, i.e. rij > 2 
and \r)(j)\ < 2.5. Among those reconstructed jets, there are two jets which could come 
from a W boson either on-shell or off-shell. In order to increase the acceptance of 
signal, we only consider those jets with transverse momentum larger than 15 GeV. 
We show the distributions of these two leading jet in Figure ([lj:l-e) . We also show the 
invariant mass of these two jets in Figure. ©• It is noticed that the invariant mass 
of in signal events can produce two peaks, one is near the value of Mw and the other 
is near that of Mh — Mw- 

• The missing transverse momentum is required to be larger than $ T > 20 GeV due 
to neutrinos in the signal processes. The requirement on large missing energy is also 
useful in order to suppress the huge QCD processes and to save the computing time. 

The LHC detectors can record signal events, which can be triggered by both energetic 
charged lepton and large missing energy. From Table [T[ we observe that the number of 
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processes 

cr LO X branching fraction (fb) 

K factors 

No. Events after preselection cuts 

Signal gg —>• HH 

3.0 x 10~ 2 

1.8 H2j 

16.3 

HW ± 

1.2 

1.2 |76] 

119.4 

WWW 

1.4 

1.8 [77] 

363.9 

ttW ± 

4.6 

1.3 [78] 

451.4 

tiH 

2.1 

1.2 [79] 

101.3 

ZW,jW 

233 

1.8 [80] 

~ 0 

S/B 

0.02 

s/Vb 

0.53 


TABLE I. The number of signal and background events are shown. Here we assume the total 
integrated luminosity as 3 ab -1 . 


background events is around 200 times larger than that of signal events, and it is indeed a 
challenge if we want to distinguish the signal and the background successfully . 

In order to distinguish the signal and background event, we have to resort to the recon¬ 
struction procedure so as to extract the most important information of signal. Since the 
Higgs boson is a neutral particle, for the decay mode £f£f£f2j + E, without considering 
the neutrinos, there are only two possible combinations for a pair of Higgs boson decay: 
(H(£f£^), H(£fjj)) or (H(£f£^),H(£fjj)). For the convenience of later study, hereby we 
label the first Higgs boson as leptonic one (H(££)) and the second one as semileptonic one 

mm)- 

As we can read out from the left panel Figj2j each of Higgs bosons is moderately boosted 
when produced and the peak value of 7 (here 7 is defined as E(H)/run, which is a measure 
to the boost) is around 2. The fraction of highly boosted Higgs boson 7 > 5 is around 13% 
or so, while the fraction of moderately boosted Higgs boson 7 > 1.5 is around 87%. In the 
right panel of Figj2j we show the invariant mass of Higgs pair. It is observed that the peak 
region of the invariant mass of Higgs pair is around 360 — 540 GeV, which explains why 
most of the Higgs pairs are boosted. 
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FIG. 2. The 7 factor (7 = E(H)/mu) and the invariant mass of Higgs pair are shown at the 
parton level. 

III. RECONSTRUCTION OF SIGNAL AND OBSERVABLES 

To know the right combination is crucial to reconstruct the kinematic features of the 
signal and can provide important information to suppress background events. For that 
purpose, we need find a way to determine the right combination reliably. 

A. Determination of the right combination 

Fortunately, the problem at hand is not complicated after the preselection and the number 
of combinations is not formidable. It is observed that in the selected events, there are 
three leptons in total. Two leptons with same sign and same flavour must come from 
different Higgs bosons, there are only two possible combinations for each a signal event. 
The remaining task is to find the right combination by exploiting the kinematics of Higgs 
bosons. 

Keeping those kinematic features of Higgs bosons in signal events exposed in the last 
section, we consider the following six individual methods by using different observables to 
pick out the combination from two as a solution for each event. In Table [17] we tabulate the 
principal observables and the percentage of correctness to pick out the right combinations at 
parton level, which serves as an important guide for our later analysis at hadronic/detector 
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Methods 

The percentage of correctness (%) 

1 m H(ll) ~ m H(ljj) 1 

68.9 

A R(l±,F) 

85.0 

Ai?(Z ± , Wjj) 

89.9 

P t [H(ll)] + P t [H(ljj)] 

90.3 

A R (H(ll),H(ljj)) 

92.0 

m H(ii) + rriH(ijj) 

95.4 


TABLE II. The principal observables to choose the right combination in six methods and the 
percentage of correctness at the parton level are tabulated. 

level. Below, we examine the efficiency of these six methods one by one. 

1 In the first method, we utilize the fact that the mass of the Higgs bosons in the pair 
production must be the same. But due to the missing energy carrying away by the 
neutrinos, if we use the condition that the mass difference should be smaller we find 
that we can only reach the right combination in 69%. 

2 In the second method, we use the fact that most of Higgs bosons are moderately 
boosted and two leptons from the leptonic Higgs are tended to be close in spatial 
separation due to the spin correlation of W bosons from Higgs decayJHIl E5], therefore 
two leptons from it decay should have a smaller angle separation A R(£ + ,£~). We 
notice that by using this observable, the right combination can be determined by 85%. 

3 In the third method, we use the semileptonic Higgs as a guide by requiring the smaller 
angle separation A R(£, Wjj ) between a lepton and a hadronic W. Due to the smaller 
energy loss from its decay, we observe a higher percentage in choosing the right com¬ 
bination when compared with the second method, which can reach to 90%. 

4 In the forth method, we resort to the scalar sum of the transverse momenta of Higgs 
pair (without taking into account the missing transverse momentum), which should 
be large due to the energy conservation in the transverse direction. When the wrong 
combination is made, the scalar sum is found decrease. The method can have similar 
performance as the third method. 
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5 In the fifth method, we exploit the fact that two Higgs bosons mostly fly back-to- 
back in 3d space, therefore the angular separation of them A R(H(££), H(£jj)) = 
a J A cf) 2 + At] 2 should be large. For the two possible combinations, we choose the one 
which yields the larger A R(H(££), H(£jj)) as the solution and we observe that this 
method can arrive at an efficiency 92%. 

6 In the sixth method, we compute the invariant mass of each Higgs boson from visible 
objects, which can be labelled as rriH(U) and rn , respectively. Then we sum these 
two masses rn Him + m H{£jj)- We choose the combination which yield a smaller value 
as the solution. We notice that this method reach the highest percentage of correction 
combination up to 95%. 

Therefore, in the following analysis, we will use the sum of invariant masses of Higgs 
bosons, i.e. the sixth method, to determine the combination and extract the relevant exper¬ 
imental observables at hadron/detector level. 

Another remarkable aspect is the missing energy, or more precisely the missing transverse 
momentum. In signal events, there are three neutrinos in total, which should have 9 degree 
of freedom to determine the full phase space. But we can obtain have at most 5 constraints. 
So in principle, it is impossible to solve the kinematics at event-by-event level. Nevertheless, 
in the hypotheses of pair production, we can split the transverse missing momenta of each 
event into two parts. The first part will combine with the lepton pair to reconstruct the 
transverse mass of the first Higgs boson, and the second part should combine with the rest 
of objects in the event to form that of the second Higgs boson. Below we will explore the 
variable 777.^2 • 

B. The variable rriT2 

The variable, rriT, the transverse mass of W boson has played a crucial role for the discov¬ 
ery of W boson[83]. The extended variable was introduced to extract the information of 
particle mass in pair production processes at hadron colliders [Ml [55] when the information 
of both the mass and longitudinal components of invisible particles are missing. 

The original setup assumed the production of a pair of particles A\ and A 2 , then particles 
Ai decay into invisible particle Bi and visible particle C*, for example: pp —> A 1 A 2 ,A 1 —> 
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B\Ci, A 2 —>■ B 2 C 2 , where Bi particles denote invisible particles like neutrinos and neutralinos 
of the SUSY and Q particles denote visible particles like leptons or jets of which the energies 
and momenta can be reconstructed by detectors. At hardon colliders, only the sum of 
transverse momentum of B\ and B 2 which is denoted as can be reconstructed by assuming 
the energy conservation in the transverse directions. In experiments, the missing transverse 
momenta $ T can be reconstructed by using the particle flow algorithm, for instance. Since 
the energy and longitudinal components are missing and in principle it is impossible to 
reconstruct the mass of m^, but we can define the transverse mass of particle A from the 
transverse momenta the particles B and C: M^(P T (B ), P T (C)) = (E T (B) + E T (C)) 2 — 
(Pt(B) + Pt(C)) 2 , where the transverse energy is defined as E\ = P%, + M 2 . There exists 
an inequality Mt(B,C) < M(A). 

In practice, to construct the variable rriT 2 we split the missing transverse momenta into 
two parts and to find the minimal of the maximal of reconstructed transverse mass: 


m T 2 = 


mm 

Pti+Pt2=1 


max 


m* (P T (Si), PriC,)), m 2 T {P T (B 2 ), P T {C 2 )) 


For each an event, the minimization is taken over all possible transverse momentum split¬ 
ting. For a pair production event, the 777,^2 corresponds to find the solution where both the 
reconstructed transverse masses from each decay chain are equal. Recently, there are more 
studies on the rriT 2 variable and its variants, interested readers can refer [86b88] for more 
information. 

Obviously, this variable can be generalized to the cases where either particles B or C are 
not a single particle then either Ai or A 2 can decay into different final states. In the case 
at our hand, the leptonic Higgs boson decays into (/ ± / =F )(nz/), and the semi-leptonic Higgs 
decays into (l ± jj)(v). In the invisible part of the leptonic Higgs contains two neutrinos and 
their invariant mass is unable to know. Considering that the variable is a monotonous 
increasing function on the 777 . 12 , for simplicity, we choose it as zero. 

For the case at our hand, after the splitting of missing transverse momenta, we can 
construct the transverse mass of Higgs boson by using the m T2 code [52] • So the first part 
of the split $ T should correspond to the combination of two neutrinos, and the second part 
of fjrp should correspond to a neutrino. So that the transverse mass of Higgs boson can be 
constructed. The quantity is called as the rriT 2 variable, which has utilised information of 
both the visible and invisible objects in an event. It is remarkable that this quantity is the 
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most sensitive observable to distinguish signal and background, as shown in both Fig. (|3ji) 
and Table (III). 




(a) Mass of leptonic Higgs (b) Mass of semileptonic Higgs 




(c) The reconstructed W/W* mass (d) tot 2 

FIG. 3. Four crucial reconstructed kinematic observables at hadron level are demonstrated. 


In Figure (J3]) , we show the line shapes of signal and background events in terms of 
A R(£,jj), rriH(£jj), and From the line shapes, we introduce a one-dimensional cut for 


each of these observables. In Table (III), we tabulate the efficiency of each cut. It is noticed 
the observable rriT 2 can have the best distinguishing power and the observable rriH(u) is the 


second powerful observable. From Table (III), it is remarkable that the backgrounds from 
ttW and WWW can be heavily affected by this quantity since they are not pair production 
processes in nature. While for the process HW, extra jets from initial state radiation should 
be used to balance the pair production hypothesis. 

When we combine all of these cuts into a cut-based method, we arrive at the significance 
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signal H H 

t t + W 

H W 

WWW 

t t H 

S/B 

s/Vb 

No. after preselection 

13.7 

317.2 

94.4 

400.6 

101.3 

1.5 x 10~ 2 

0.45 

rrijj < 80 GeV 

10.6 

153.7 

53.3 

189.6 

78.7 

2.2 x 10 -2 

0.49 

) <110 GeV 

9.6 

70.6 

27.8 

78.2 

54.8 

4.2 x 10 -2 

0.63 

rn H (U) < 55 GeV 

11.2 

76.9 

65.0 

92.8 

53.6 

3.9 x 10 -2 

0.66 

tut 2 <110 GeV 

8.4 

18.4 

16.7 

19.1 

27.1 

1.0 x 10” 1 

0.93 


TABLE III. The efficiency of four crucial cuts are demonstrated. To appreciate the efficiency of 
each cut, we also provide the values of S/B and S/VB after each a cut. 



signal H H 

t t “h W 

H W 

WWW 

t t H 

No. after preselection 

13.7 

317.2 

94.4 

400.6 

101.3 

rriT2 <110 GeV 

8.4 

18.4 

16.7 

19.1 

27.1 

m H(li) < 55 GeV 

7.2 

10.5 

13.0 

11.5 

19.0 

No. of jets <= 4 

6.2 

8.0 

12.0 

8.8 

7.9 

S/B 

0.17 

s/Vb 

1.0 


TABLE IV. The effects of each cut in the cut-based method are demonstrated in a sequential way. 
After all cuts, the values of S/B and S/^B are provided. 


given in Table (IV). After using the quantities extracted from our reconstruction procedure, 


we notice that the S/B can be improved by a factor 10 or so. Compared with the results 
given in Table (JlJ) , we observe a big gain in both S/B and S/s/B. The gain is mainly 
yielded by the success of suppression to the background processes ttW and WWW. In 
contrast, the suppression to the background HW is relatively limited due to the appearance 
of a real Higgs boson in the process and our reconstruction procedure can find the Higgs 
bosons in the events. For example, the cut rri h(M) < 55 GeV has no serious effects to this 
background process. But, instead, the variables from semi-leptonic Higgs can impose a 
significant suppression to this type of background. 
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after preselection cuts 

Cut-based method 

MLP method 

BDT method 




Nnn > 0.82 

Nbdt > 0.41 

No. of Signal 

13.7 

6.2 

5.7 

3.8 

No. of Background 

913.5 

36.8 

21.7 

6.2 

S/B 

1.5 x 10 -2 

1.7 x 10 _1 

2.6 x 10' 1 

6.2 x 10” 1 

s/Vb 

0.45 

1.0 

1.2 

1.5 


TABLE V. Comparison of significance among three analysis methods are shown. 

IV. THE SENSITIVITY TO TRIPLE HIGGS COUPLING 

A. The sensitivity to A 3 at LHC 14 TeV with a 3 ab 1 dataset 

Considering that the number of signal event is few and the number of phase space of 
the final states is 24 (there are 9 dimensional space contributing to the missing energy in 
signal events) and most of variables are correlated, we optimize these cuts and include more 
observables which are independent of those four observables in the cut-based method. We 
have included more kinematic observables in our analysis: 

• The sum of transverse momenta of all objects used in the reconstruction procedure is 
considered, of which the distribution of signal and background are shown in Fig. (|4^). 

• The transverse momenta of the harder jet used to reconstruct the hadronic W/W* 
is taken into account and is shown in Fig. (|4 Jd) . Due to the existence of off-shell W 
bosons, the momentum is softer than the background events. 

• The invariant mass of all the visible objects (including 3 leptons and all jets) is pre¬ 
sented in Fig. (J 4 J 3 ) , we observe that the signal events typically have smaller values 
when compared with the background events. 

• The transverse mass obtained from the combined 4-momentum of three leptons and 
the missing transverse momentum is shown in Fig. (|4jl). 

We have also exploited other observables, like the transverse momenta of leptonic and serni- 
leptonic Higgs, the angular separation of two partially reconstructed Higgs bosons, the 
number of jets in each event, the ratio of missing energy over the visible energetic, etc. 
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(a) Pt Sum of objects used for reconstruction (b) Pt of harder jet in the hadronic W boson 




(c) Invariant mass of visible objects (d) The transverse mass of leptons and 

FIG. 4. We show more observables at hadron level used by the multivariable analysises. 


We apply two multivariable analysis’s: one is BDT, the other is MLP neural network. 
The distributions of other useful observables are shown in Fig. Q. 

The results of multivariable analysis are presented at Table |V]) and the distributions of 
discriminant response to signal and background are shown in Figure (J 6 ]). We observe that 
the S/B can be improved by a factor 20 and the significance can reach up to 2.0 or so, which 
are very encouraging. 


We plot the estimated sensitivity to A 3 at LHC 14 TeV with a 3 ab 1 dataset in Fig. 


( 6 (a)). Although there are the large number of background events, we are capable to rule 
out the value of A 3 < — 1.0 and A 3 > 8 . 0 ; while if A 3 is within the range — 1.0 < A 3 < 8 , it 
might be challenging to determine the value of A 3 due to background fluctuations. 
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(a) MLP Neutral Network (b) Boosted Decision Tree 

FIG. 5. The response of the discriminants to signal and background in two Multivariable Analysises, 
the MLP NN and BDT methods, are demonstrated. 


Sensitivity at LHC 14 TeV with a 3000/fb dataset 



(a) Sensitive to A 3 at 14 TeV LHC 

FIG. 6 . The sensitivity to the triple couplings 
TeV collider is shown. 



-1 012345678 


(b) Sensitivity to A 3 at a 100 TeV collider 
Higgs boson, A 3 , at the LHC 14 TeV and a 100 


B. The sensitivity to A 3 at a 100 TeV Collider 


We apply our analysis demonstrated in the last section to a 100 TeV collider. It is noticed 
that both the production rate of signal and background with top quarks enhanced by a factor 


40 or more than 100. In Table (VI), we tabulate the results obtained from the cut-based 


method and find that the significance can reach to 7.0. 

When compared with the 14 TeV collider case, it is remarkable that the background 
ttH becomes the dominant one after all cuts. Here we haven’t applied any special variable 
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to further suppress this type of background, we expect better results when these special 
variables of ttH are used. Another remarkable fact is that although the production rate of 
the background from tttt also enhances by a factor 200 , simply by counting the number of 
jets can efficiently kill most of this type of background. 



signal H H 

t t + W 

H W 

WWW 

ttH 

tttt 

No. after preselection 

416.8 

4392.3 

716.1 

4384.1 

5045.9 

263.48 

rriT 2 <110 GeV 

234.3 

234.6 

116.8 

125.4 

1152.9 

26.8 

%(«) < 55 GeV 

202.3 

133.9 

94.8 

71.6 

811.6 

15.5 

No. of jets <= 4 

160.0 

81.8 

82.4 

53.5 

304.9 

1.0 

S/B 

0.31 

s/Vb 

7.0 


TABLE VI. The effects of each cut in the cut-based method are demonstrated in a sequential way 
for a 100 TeV collider. After all cuts, the values of S/B and S/y/~B are provided. 


In Table. (VII), we tabulate the optimised results. Similarly to the 14TeV case, we 
notice that the significance can be improved by a factor of 3.5 or so and the ratio S/B can 
be improved by two order. 



after preselection cuts 

Cut-based method 

MLP method 

BDT method 




Nnn > 0.94 

Nbdt > 0.22 

No. of Signal 

416.8 

160.0 

80.4 

104.0 

No. of Background 

14801.8 

523.6 

107.3 

67.1 

S/B 

2.8 x 10 -2 

3.1 x 10 -1 

7.5 x 10 - 1 

1.5 

s/Vb 

3.43 

7.0 

7.8 

12.7 


TABLE VII. Comparison of significance among three analysis methods in a 100 TeV collider are 
shown. 


The sensitivity of a 100 TeV collider to the triple coupling A 3 is provided in Fig. ( 6 (b)). 
So a 100 TeV collider can exclude all values of A 3 by simply using the 3 leptons mode 
considered in this work. Here our multivariate analysis has been optimised for the SM, i.e. 
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A 3 = 1, all A 3 out the range [2.8, 4.5] can be discovered. Nonetheless, if we optimise our 
analysis to different A 3 , we notice that even for the minimal cross section case with A 3 = 3.6 
or so, the significance can reach to 5cr. 

Since there is no doubt that the SM triple Higgs coupling can be discovered at a 100 
TeV collider, below we concern the issue how well this coupling can be measured by just 
using the trilepton mode considered in this work. To address this issue, we use the invariant 
mass of three leptons to perform a y 2 analysis. The distributions of this variable after the 
preselection cuts and the multivariate analysis are shown in Fig. (J7|). We have deliberately 
chosen three different values of A 3 to demonstrate the differences in magnitudes and shapes. 
Since all cuts are optimised to the SM case, one can perceive the signal shapes of the cases 
A 3 = —2 and A 3 = 5 have been greatly changed by the MVA filter. 




(a) Three leptons mass after preselection (b) Three lepton mass after MVA cut 

FIG. 7. The distributions of the invariant mass of three lepton after preselection and after MVA 
cut are demonstrated, where the background events have been neglected. Three different values of 
A 3 are chosen to show how the triple Higgs coupling can affect the magnitudes and shapes. 


Below we address the issue how well the value of A 3 can be determined. By using the 
method described in Refs. | 6 Ql ED], we use 10 bins to perform a y 2 analysis on the distribu¬ 
tions of the invariant mass of three leptons. The expression for y 2 is given by 


X (A 3 ) — 2_ -Wo- \ n D ~ 1) j 


2—1 


m 


( 2 ) 


where np denotes the number of bins, and N t is the number of events which include both the 
signal and background after cuts. Obviously N t is dependent upon the triple Higgs coupling 
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parameter A 3 . N® means the number of events in the SM in the i-th bin after cuts, here the 
SM means A 3 = 1. Here the quantity / encodes the uncertainty in the normalisation of the 
SM cross section within the allowed range, which is determined by minimising y 2 


/ 


(1 + AN)- 1 for / < (1 + AN)~\ 
f for (1 + AN)- 1 < f < 1 + AN, 

1 + A/ for / > 1 + AN , 




where AN is taken as 10% of the SM cross section (including both the signal and background 
after all cuts). The parameter / is defined as 


P = 

The results are presented in Fig. (| 8 |) 
There are two comments in order: 


i=i * i= 1 

where the common term no — 1 has been omitted. 


In the Fig. ( 8 (a)), we observe that a 100 TeV collider can distinguish the two cases 
A 3 = 1 and A 3 = 6.3, although total cross sections of these two cases are equal. 
The difference in the lineshapes of the invariant mass of three leptons is sufficient to 
separate them from each other. 

To appreciate the underlying reason why these two cases are separable, at leading 
order, we can represent the differential cross section in the following form as 

d 2 cr 


= (A 3 C A + C n ) 2 + D 2 U , 


(3) 


ds dcosd 

where the Ca and Cu are S — 0 form factors and Do is S = 2 form factor, which are 
dependent upon s and cos 0 and their exact expressions at leading order can be found 
in pH]. In term of this form, total cross section can be expressed as 


if = \\C\ + 2 A 3 C A C D + C 2 + D 2 , (4) 


where all overlined quantities, like C\, CaC\j, and C □ + D^, denote the integrated 
values, which are just numbers. When total cross section is fixed as a by experimental 
measurements, there are two solutions of A 3 which can yield the same total cross 
section. These two solutions can be expressed below as 


A 


± 

3 


C A Co ± ^(a-Ch-D^Cl-CACn 

W 


(5) 
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From these two solutions, the difference of differential cross section between them can 
be expressed as 


d 2 a + d 2 a 

ds d cos 6 ds d cos 9 


2(-V - U) Ca (Ch - a) ■ 


( 6 ) 


It is noticed that two form factors, i.e. C& and C*n, can completely determined the 
difference of lineshapes of these two cases. 


In the Fig. ( 8 (b)), we observe that by using the y 2 fit of the invariant mass of three 
leptons, the value of A 3 can be determined as IIqJ in the 95% confidence level, which is 
wider than the value 11 q ' 2 or so when only the statistic accuracy is taken into account. 
But if the total luminosity of can reach to 30 ab _1 , it is possible to reach this precision 
or better. This result is comparable to the precision which might be achieved from 
both the signals of 6677 final state and those of 6677 with a hard jet 




6 8 
V^SM 


0.5 


1.5 


2.5 3 


(a) x 2 varying with A 3 (b) Fine structure of y 2 near A 3 = 1 

FIG. 8 . The y 2 varying with A 3 at a 100 TeV collider with a dataset 3000 fb -1 is shown. 


Since observables constructed from leptons are expected to be more robust than those 
constructed from jets due to the contamination of underlying events and pileup of high 
luminosity run at future colliders, so we have used the invariant mass of three leptons to 
perform the y 2 fit. We can use the lineshape of the transverse momenta of leptonic Higgs, 
the invariant of leptonic Higgs, etc, to perform alternative y 2 analysis. It is noticed that 
they can yield the similar results. 
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V. DISCUSSION AND CONCLUSION 


In this work, we have considered the feasibility of 3£2j + $ mode to discover the signal 
of Higgs pair production at the LHC and a 100 TeV collider. We have proposed a partial 
reconstruction procedure to reconstruct two Higgs bosons in the final state and have exam¬ 
ined the rriT 2 observable in the hypothesis of pair production to discriminate the signal and 
background events. Although the production rate of signal events at the LHC 14 TeV 3000 
ftr 1 is small, we have found that this mode can yield a significance around 1.5 or so. For 
a 100 TeV collider with the same integrated luminosity, we noticed this mode can be used 
to determine the A 3 and triple coupling of the Higgs boson of the standard model can be 
determined into the range IIqJ. If the total integrated luminosity is 30 ab _1 , we estimated 
that it is possible to achieve I+q 

At our hadron level simulation, in order to take more signal into account, we have de¬ 
liberately chosen A R(£) > 0.2 to find isolated leptons; after having found these objects, 
we cluster the rest of particles and energy into jets. We notice that the angular separation 
cuts between leptons and that between a lepton and a jet can affect the selection efficiency 
of signal events to a quite considerable degree. Therefore, we provide the distributions of 
A R min (£J) and A R min (l,j) in Fig. 




FIG. 9. For the signal events at the hadron level, we show the minimal angle separation between 
two leptons (the left panel) and the minimal angle separation between a lepton and a jet (the right 
panel) here. 


In this work, we have used the lepton isolation criteria A R(£) = 0 . 2 , which is possible 
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when the fine granularity of tracker detector and electromagnetic calorimeter are taken into 
account. By changing this condition to A R(£) = 0.3, we observe that the signal loss is 
around 10%. When we demand R mm (£, j) < 0.4, we notice that the signal loss is around 
15%. 

We haven’t included more detailed detector effects, like the pileup effects which may 
mitigate the reconstruction of two soft jets coming from the off-shell W decay. For a more 
realistic 100 TeV collision study, the pileup effects can be a serious issue [93] due to the fact 
that we need identify two jets in the signal event, which deserves our future careful study. 

We have used the B veto and have assumed B tagging efficiency as 0.6. If the B tagging 
efficiency is assumed to be 0.7 and more background events from ttW can be better rejected, 
then we expect a better realistic significance. Beside, information of color-flow of two jets 
from W/IF*, which are colour singlet objects, can be used to determine the right combination 
and may provide further improvement. Considering these potential further improvements 
for this mode and, in contrast to the contamination of pileup effects and underlying events 
which might mitigate the modes with B jets in the final states for the 6677 mode [94], we 
believe this mode might be robust and promising and should be seriously considered by 
experimenters. 

We can extend this work to study the same-sign dilepton modes of the Higgs pair pro¬ 
duction at both the context of the LHC and a 100 TeV collider. I 11 a 100 TeV collider, 
the production rate of Higgs pair can be quite significant, we can extend the partial re¬ 
construction method and analysis demonstrated here to the four-leptonic mode of Higgs 
pair production, which should be clean and robust against the contamination of underlying 
events and pileup effects. 
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